Getting More from the Singular Value Decomposition (SVD): Enhance Your Models with Document, Sentence, and Term Representations
نویسندگان
چکیده
Since its inception, SAS® Text Miner has used the singular value decomposition (SVD) to convert a termdocument matrix to a representation that is crucial for building successful supervised and unsupervised models. In this presentation, using SAS® code and SAS Text Miner, we compare these models with those that are based on SVD representations of subcomponents of documents. These more granular SVD representations are also used to provide further insights into the collection. Examples featuring visualizations, discovery of term collocations, and near-duplicate subdocument detection are shown.
منابع مشابه
Graph Clustering by Hierarchical Singular Value Decomposition with Selectable Range for Number of Clusters Members
Graphs have so many applications in real world problems. When we deal with huge volume of data, analyzing data is difficult or sometimes impossible. In big data problems, clustering data is a useful tool for data analysis. Singular value decomposition(SVD) is one of the best algorithms for clustering graph but we do not have any choice to select the number of clusters and the number of members ...
متن کاملSignificant Sentence Extraction by Euclidean Distance Based on Singular Value Decomposition
This paper describes an automatic summarization approach that constructs a summary by extracting the significant sentences. The approach takes advantage of the cooccurrence relationships between terms only in the document. The techniques used are principal component analysis (PCA) to extract the significant terms and singular value decompostion (SVD) to find out the significant sentences. The P...
متن کاملFeature Extraction of Visual Evoked Potentials Using Wavelet Transform and Singular Value Decomposition
Introduction: Brain visual evoked potential (VEP) signals are commonly known to be accompanied by high levels of background noise typically from the spontaneous background brain activity of electroencephalography (EEG) signals. Material and Methods: A model based on dyadic filter bank, discrete wavelet transform (DWT), and singular value decomposition (SVD) was developed to analyze the raw data...
متن کاملClustered SVD strategies in latent semantic indexing
The text retrieval method using Latent Semantic Indexing (LSI) technique with truncated Singular Value Decomposition (SVD) has been intensively studied in recent years. The SVD reduces the noise contained in the original representation of the term-document matrix and improves the information retrieval accuracy. Recent studies indicate that SVD is mostly useful for small homogeneous data collect...
متن کاملSynthesis, characterization and spectroscopic properties of new azo dyes derived from aniline derivatives based on acetylacetone and azo-metal (II) complexes and singular value decomposition (SVD) investigation
Four new azo-dyes, 3-phenyl azopentane-2,4-dion (LA), 3-(4-nitro phenyl azo)-pentane-2,4-dion (LP), 3-(2-nitro phenyl azo)-pentane-2,4-dion (LO) and 4-(1-acetyle-2-oxo-propyl azo)-benzene sulfonate sodium (LS), were synthesized from, aniline, 4-nitroaniline, 2-nitroaniline and sulfanilic acid with acetylacetone, respectively. Reaction of these new dyes with acetate salts of copper(II), nickel(I...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016